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The concern of this chapter is a collection of methods 
for the evaluation of a finite sum whose summands are 
given as a sequence, either in a functional form f(k), or 
in a subscripted form x;. Analogous to the sense in which 
a real function may have for its integral over an interval 
an anti-derivative function evaluated at the bounds of the 
interval, the value of such a finite sum may be given by 
some other function of the lower and upper limits of the 
index k. 


REVIEW FROM 81.4: 


e Let (z,,) be a sequence. Then the expression 
ag; = Spe en sb ee ea (3.0.1) 
j=0 


(and also its value) are called the n‘“ partial sum. 


NOTATION: Sometimes S,, denotes the n™ partial sum. 
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3.1 NORMALIZING SUMMATIONS 


DEF: A consecutive summation is an expression 


b 
> 
k=a 


where a and 6 be integers or integer-valued variables, and 
where (z,,) is a sequence with its values in an algebraic 
structure (e.g., the integers, reals, or complex numbers) 
with an associative and commutative addition. Its value, 
the sum, is defined recursively. 


‘ 0 ifb<a 

S- ee ifb=a 
Lp = 

k=a ( x) +a, ifb>a 


The parameters of the expression have names: 
e k is called the index variable; 
e ais called the lower limit of the index; 


e bis called the upper limit of the index; 


x, 18 called the summand. 


If the lower limit a and the upper limit 6 are both given as 
fixed integers, then the sum has a definite value within 
the domain of its summands. For instance, if the sum- 
mands are integers, then the sum is an integer. 
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2 
Example 3.1.1: S°k? = 0?+1°+2? = 5. (3.1.2) 
k=0 


Quite commonly, a summation has a lower index limit 
fixed at 0 and a symbolic upper limit of n, in which case 
summation may be regarded as an operator on a sequence 


(Les! | a0 As, Shes) 


whose application produces a sequence of partial sums 


ome | n = 0, ise) 
j=0 


akin to the way that integration operates on a function to 
produce a new function. This chapter develops methods 
for evaluating the summation, which, in this context, 
often means producing a closed formula for the elements 
of the sequence of partial sums. 


Example 3.1.1, cont.: With the variable n as the up- 
per limit, the value of the sum of the form (3.1.2) is 


Sk = 2n° + 3n7 +n 

k=0 6 
This formula could be confirmed immediately by mathe- 
matical induction, or by any of the methods of summation 
to be introduced in subsequent sections of this chapter. 
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Sums over Sets 


In a more general context, the indexing set of a given 
summation may be any finite set 7’. Given any function 
f with values in Z, Q, R, or C, the sum 


yo) 


is well-defined. In a sum over an unordered indexing set, 
the order in which the index variable t takes its values is 
not specified or implied, and the value would be the same 
for any order of summation. 


Example 3.1.2: The sum of the weights of the edges in 
the graph G' of Fig 3.1.1 is represented by the expression 


» w(e) 


eCEg 
whose value is 


OoP (eed oop oor GO OOP O44 E10 6 =. 62 


a 





3 
d 

5 
g 


Fig 3.1.1 An edge-weighted graph. 
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In principle, the edges of the graph G could be indexed 
by integers 0,1,...,11, which would permit the sum of 
their weights to be represented by a consecutive summa- 
tion. ‘There seems to be little gained from doing so in this 
example. Our focus here is to do something more efficient, 
when possible, than successively incrementing a running 
total by additional summands. Such tedium is unavoid- 
able when the summands have no discernable pattern, es- 
pecially if the summands are random numbers. However, 
in many other cases, when the index set is a subset of the 
integers, a transformation may simplify the evaluation. 


DEF: Indexing the summands over consecutive integers is 
called normalizing a summation. 


Example 3.1.3: ‘The sum 


Sk 


eee 
odd 


can be normalized to 
9 
Del (2k+1 
k=0 


which is readily transformed further into 


9 9 9 


S Chel = okt 1 


k=0 k=0 k=0 
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9 
Z k + 10 
k=0 


102 
2° — + 10 (by Corollary 1.5.2) 


102 + 10 
— 100 


Many of the methods to be introduced in this chapter 
are designed to work on normalized summations. Other 
sums are transformed into consecutive sums to permit the 
application of such methods. 


Iverson Truth Function 


When the index variable of a summation has irregular 
gaps in its range, it may still be possible to normalize, by 
inserting into the summand an artificial multiplier that 
effectively cancels the summand across the gaps. 


Example 3.1.4: For instance, the index variable p of 


the sum 
>) VP (3.1.3) 


pon 
p prime 


has gaps between consecutive primes. 


DEF: The Iverson truth function is defined by the rule 


(predicate) = 1 if the predicate is true 
~  |0. if the predicate is false 
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Example 3.1.4, cont.: Using the Iverson truth func- 
tion facilitates the reformulation of (3.1.3) as a consecutive 
summation. 


>» vP = >_ [@ prime): yP] 


pon 
p prime 


CONVENTION: ‘The value of the product 

(P(k)) » ax 
is 0 whenever the value of the Iverson expression (P(k)) 
is 0, even when a, is undefined. 


Example 3.1.5: ‘The value of the sum 


mr 


SS - -(p prime) 


p=0 


is well-defined, since the “strong zero” of the Iverson ex- 
pression (p prime) cancels the effect of the undefined quo- 
tient ; when p= 0. 


Algebraic Regrouping 


Part of the art of simplifying and evaluating sums is 
to manipulate them so that recognizable forms emerge. 
The familiar algebraic properties of the number system 
include several principles for regrouping. These principles 
are applied independently and also in conjunction with 
the other summation methods of this chapter. 
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Proposition 3.1.1 [Distributive Law]. A common fac- 
tor can be distributed over all the summands. 


sae =e Ge 


kek kek 


Proposition 3.1.2 [Addition Law]. Two sums over the 
same index set can be combined into a single sum by 
adding each pair of summands with the same index. 


S- (an + on) = So an+ S_ by 


kek kek kek 


Proposition 3.1.3 [Permutation Law]. The value of 
a sum is unchanged by permuting the order of the sum- 


mands. 
>» an = > On(k) 


kek kek 


As a first illustration, we apply these algebraic re- 
groupings to an arithmetic progression. From our present 
perspective, that means a sequence (Gn) given by a recur- 
rence of the form 


ag = Cc 


An—1 + 6 for n > 0 


Gn 


For instance, the consecutive odd numbers 3, 5,7,9,... are 
an arithmetic progression, with initial value c = 3 and 
increment 6 = 2. 
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Example 3.1.6: Simplifying the sum of a finite arith- 
metic progression 


mr 


Sn = >_ (c+ dk) (3.1.4) 


k=0 
can begin with application of the Permutation Law. 


rr 


Sn = >> (ce +b(n—h)) (3.1.5) 


k=0 


Adding equations (3.1.4) and (3.1.5) leads into the follow- 
ing analysis. 


SY (e+ bk) + So (c+ 6(n—k)) 
> [(c + bk) + (c+ b(n — k))| (Addition Law) 


Se (2c + bn) 


k=0 


254 


(2c + bn) SS 1 (Distributive Law) 
k=0 


(2c + bn)(n + 1) 


> Sn = (c+ =) -(n +1) (3.1.6) 


11 
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Example 3.1.7: This is a special case of formula (3.1.6) 
mr 1 - 
Sik = (0+ *) eT) 
k=0 
n 
= (Grn 
— {ntl 
7 2 
For instance, 
-=() 
2 


Order 3 Paso 


Harmonic Numbers 


REVIEW FROM 81.2 
The sequence of harmonic numbers (H,,) is defined 


by the rule 


1 


forn >0 


i ie 


1 
oie eo 


— Jt 2 


The harmonic numbers are the discrete analogue of the 


natural logarithm 
In(n) = / _ dx 
1 @& 
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Figure 3.1.2 illustrates that the harmonic number and the 
natural logarithm are reasonably good approximations of 
each other. Familiarity with upper and lower Riemann 
sums may add some interest here, but such familiarity is 
not necessary for understanding of the correctness of the 
approximation. 





ie 
_ Zim WILLE. 


x=1 x=4 = 


Fig 3.1.2 Upper and lower Riemann approximations of 4, 


X 


Since the area under the curve 1/z over the interval [1, 5] 
is In(5), one observes that In 5 is less than the sum of the 
areas of the upper rectangles, 1.e., 


1 i. 1 1 
mb< 7+ 5+ gtg = Me upper sum 


and that In5 is greater than the sum of the areas of the 
lower rectangles, i.e., 


H. ese Se aides WN l 
5 Soe ear se n ower sum 


This observation generalizes to the following: 
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Proposition 3.1.4. For any positive integer n 
(a) In(n+1) < H, < In(n) +1 
(b) H, —1 < In(n) < An-i 


Proof: Summing the areas of the upper rectangles (i.e., 
taking upper Riemann sums) yields 





1 1 
In(n+1) < qi ep FH, (3:16) 
ie 
In(n) < ; rr hee te : H (3.1.8) 
n(n wa ree = is zs 
1 a : 


and summing the areas of the lower rectangles (i.e., taking 
lower Riemann sums) yields 


1 1 
Together, (3.1.7) and (3.1.9) imply part (a). Similarly, 
(3.1.9) and (3.1.8) imply part (b). & 


GKP Notations 


DEF: Let n and d be integers. If there is an integer g such 
that n = dq, then we say that d divides n. Notation: 
GT. 


DEF: Let m and n be integers whose greatest common 
divisor is 1. Then we say that m and n are relatively 
prime. Notation m L n. 
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3.2 PERTURBATION 


The initial step of a perturbation is to equate two ex- 
pressions for S;,41, the n+1* partial sum of the sequence 
(tn). 

n+1 


Sn ae ntl = LO - Ek 
k=1 


The sums on both sides of the equal sign are clearly equal. 
Perturbation is a practical method, and additional tricks 
are used as needed. What makes it interesting is not the 
theory behind it, but the fact that it works so effectively 
so often. 


Example 3.2.1: A very simple first example of applying 
perturbation is to evaluate the summation 


ae 32) 
k=0 


Of course, the solution to the summation (3.2.1) is easily 
obtainable by other methods, but the details serve as a 
good illustration of the technique of perturbation. 
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n+1 n+1 
Sy torth = 2°45 72% = 14+5°2* (set up) 


= 1+ s get (change of limits) 
k=0 
= 142 5 oF 
k=0 

= e255 
=> oo Sa (solution) (322) 
For instance, for n = 3, the value of the sum (3.2.1) is 

22 42h 42742? = 1424448 = 15 
and the value of the closed formula (3.2.2) is 
24-1 = 16-1 = 15 


A Classical Example of Perturbation 


Example 3.2.2: <A classic example to show the power 
of the method of perturbation is the sum 


oo Se (3.2.3) 
k=0 


which is not so easily evaluated by the most elementary 
methods. The setup used here (and on Example 3.2.1) is 
characteristic of applications of the perturbation method. 
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n+1 n+1 
Sx t(n+1j2"t* = 0-29 + 5 k2* = So k2* (set up) 
= »S Keo (change of limits) 


k=0 


= ys kore yo (Addition Law) 
k=0 k=0 


= 2 Se bor 9 ». a (Distributive Law) 


= 25, + 2(2"t* —1) (from Example 3.2.1) 
= Sp = (n+1)2"t" —2(2"t7? — 1) 
= eto (solution) (3.2.4) 


For n = 3, the result of the term-by-term summation 


(3.23) 
3 
y_ ka? Sep ao fi aor ee se 08 
k=0 


= 04248424 = 34 


agrees with the evaluation of the formula (3.2.4) derived 
by perturbation. 


(n—1)2"t* = (3—1)2*+2 
= 2-16+2 = 3242 = 34 
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Indirect Perturbation 


When a first perturbation misses the target, it may 
help to adjust what is to be perturbed and to try a second 
time, as indicated by the next example. 


Example 3.2.3: We evaluate the sum 


Se = ye (3.2.5) 
k=0 
by perturbation, as in previous examples. 
n+1 n+1 
Sut Hner = Ho + D/H = 0+ DMs 
mr mr 1 
Yo Hen = (e+ 7) 


Tr Tr 1 
> oe DS pa 
k=0 k=0 


=. a 
Pt Day 


—~ 1 
k=0 


Formula (3.2.6) is quite correct, but it is not what was 
wanted, since the symbol 5, cancelled out. When this 
occurs, a standard maneuver is to multiply the summand 
by the index variable & and to perturb the result. 
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Example 3.2.3, cont.: Multiplying the summand HA; 
by the index variable in this example yields the summation 


= Ski, (3.27) 
k=0 


which we now perturb, as follows. 


n+1 
Sa t(n+)Hna1 = 0Ho + Rik 
k=1 
n+1 
QO + S_ kit 
k=1 


> + DA 


k=0 


= S(k+1) (14 - oa) 


k=0 


mr 


k+1 
= es ite. et 


= Soh + SH + yn 


= S, + SH, + ntl 
k=0 


=> SJ Ay = (n+VAngi—(nt1) (3.2.8) 
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This time, the result is a formula (3.2.8) for the sum of 
consecutive harmonic numbers, the formula we actually 
want. For n = 3, directly adding the harmonic numbers, 
which are the summands of the sum (3.2.5) 


Son, = o+(t)+(te2)e(tatet) = 8 
7 1 i 2 P2737 = 3 
k=0 

and applying the summation formula (3.2.8) 


25 25 13 
4H,—A4 — AN spp a SA — rel — => 
12 3 3 


yield the same result, thereby illustrating correctness of 
the formula. 


As a second example of indirect perturbation, con- 
sider the problem of deriving a formula for summing k?. 


Example 3.2.4: ‘To evaluate the sum 
Sic Ss ke (3.2.9) 
k=0 


we start as usual. 


n+1 n+1 
Snt(n41P = 0 +5°R = SOR 
k=1 k=1 
= Si (k+1? = 5 (kh? 4+2k+1) 
k=0 k=0 
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— Sk + Sk + 1 (Addition Law) 
k=0 k=0 k=0 


oe hk? +2 SS k+ > 1 (Distributive Law) 
k=0 k=0 k=0 


= S, +2) 0k + (nt+1) 
k=0 


= SS eee) = eee (3.2.10) 


Thus, as in Example 3.2.3, direct perturbation has yielded 
a correct equation that is not the desired result. Seeking 
to remedy this situation, we once again multiply the sum- 
mand by the index variable and re-perturb. 


Example 3.2.4, cont.: Since perturbing the sum of 
consecutive values of k? just above has yielded an eval- 
uation for the sum of consecutive values of k, it may be 
less than fully surprising that perturbing the sum of values 
of k® yields a formula for the sum of values of k?. This 
time, set 


See ye (3.211) 
k=0 


Then 


n+1 n+1 
Sot (nti1p =O +5 0 = Sok 


Section 3.3 Summing with OGF’s 21 


rr 


So (k+1)° 


= > (hk + 3k? + 3k +1) 
k=0 
=S°R435°h 435 k4+ 501 
k=0 k=0 k=0 k=0 
=6,+3) Rh +3 k+ 501 
k=0 k=0 k=0 


= 3S 0k = (n+1)° LS 
k=0 k=0 k=0 


3n? + 3n 


= (n+1)° 5 = A) 
_ yes 3n? +5n+2 _ 2n? + 3n7 +n 
7 > = 2 
. In? + 3n7 +n 
bk? = 3210 
=> d ; (3.2.12) 


For n = 3, the value of the sum (3.2.9) 
3 
Soe = FP 42743? 
k=0 
= 04+1+4+419 = 14 


agrees with the value of formula (3.2.12) 


2n? + 3n7 +n _ Pe Bota Oo ae _ 84 = ih 
6 BOG 
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3.3. SUMMING WITH OGF’S 


In this section, it will be seen that most of the sums 
evaluated in §3.2 could easily be evaluated, alternatively, 
by using OGIF's, as indicated by Theorem 1.7.4 and its 
corollaries, with the aid of partial fractions, as needed. 


REVIEW FROM 81.7: 


e Theorem 1.7.4. Let B(z) be the OGF for a sequence 
(b,,). Then the OGF for the sequence of partial sums 
of (b,,) is 








B(z) 
L= 2 
1 = n+r—1 
Cor 1.7.5. = a 
2s <> py — Je 
1 = n+r—1 
Cor 1.7.6. ——— = ie” 
rere (1 — az)" > ( r—1 Jars 


Revisiting Examples 


Example 3.2.1, revisited: We examine how to use gen- 
erating functions to rederive the summation formula 


ST 2h = Oa 
k=0 


Section 3.3. Summing with OGF’s 23 
for the powers of 2. As first mentioned in 81.7, the OGF 
for the sequence (bj, = 2”) is 


1 
]-= 22 


By Thm 1.7.4, the OGF for the sequence 


(% = a = Sth n=O) 
k=0 k=0 





Biz) = 





is 





UZ): = dune” = — Ble) 
~ = 1 
~ (l—2z)(1-2z) 


By the method of partial fractions (described in §2.3), 
which here involves the solution of a pair of simultane- 
ous linear equations, it follows that 


oo el 
(l—z)(l-2z)  (1-z) (1-22) 





OO 


So (-1)2” + s Ie 
n=0 


n=O 
oO 


= Ore) 


n=O 


Gg = OS 
k=0 
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Example 3.2.2, revisited: We now rederive the sum- 
mation formula 


Sk = (n—1)2"t1 + 2 
k=0 


Corollary 1.7.6 provides the formula 
1 SOP SY, ens 
(1 — az)" => r—1 Jars 


into which the substitutions a = 2 and r = 2 yield 


(1 — 22)? ->( 1 je = ee 
from which it follows that 
(1—2z)2 — Duta tajartiert® = Snare 


Thus, the OGF for the sequence (6,, = n2”) is 


UZ 


ou) Geog? 


By Theorem 1.7.4, the OGF for its sequence 


(vs =: ke 10h.) 
k=0 
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of partial sums is 





Vig) = oe : B(z) 


n=0 

i oe 

~ (L—2z)(1 — 22)? 
By partial fractions, which this time requires the solution 
of three simultaneous linear equations, we have 

22 _ 2 i 8z 2 
(l—z)(1-—2z)? (1-2) (1 — 2z)? (1 — 22)? 

Thus, by Corollaries 1.7.5 and 1.7.6, it follows that 


tn = > k2h = 24 4n-2" — (n+ 1)artt 
k=0 


= (n—1)2"*' 42 


For this example, the previous evaluation by perturbation 
may seem less effort than the method of generating func- 
tions, because of the linear equations and the care needed 
to apply Corollary 1.7.6 accurately. 


Example 3.2.4, revisited: To rederive the summation 
formula 


sR _ 2n° + 3n2 +n 
k=0 : 


the method of generating functions is easier than pertur- 
bation, since it avoids the false start encountered in per- 
turbation, which is unlikely to be discovered until the late 


26 Chapter 3 Evaluating Sums 


stages. To derive the OGF for the sequence (by = k?) we 
start with Corollary 1.7.5. 


1 —(ntr-1 és 

(l-z)r > ( r—1 ): 
n=0 

For r = 3, this yields 


qa & (2) 








n=0 
Therefore, 
a ~. (3) : “nan ,, 
= Zz = az 
c= = BG) = 2s 
and 








5; 7 Cy) - - tte 


— n=O 


from which it follows that 
ge 
Tae => nz" = B(2) 
By Theorem 1.7.4, the OGF for the sequence 


(% = se 00h) 
k=0 








is 
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Corollary 1.7.5 with r = 4 is 


a7 a 











n=0 
‘Thus 
ges - n+3 n+3 
Y(z) = = ( )e n+2 ( ) n+1 
a=aF = 2s Xs 
(ye + je = ee 
= zi 
n=O 3 n=O 3 
Therefore, 
n+1)32 2\3 
= MEE, (n+2) 
6 6 
= ne—n n°? + 3n? + 2n 
«6b 6 
a Qn? + 3n7 +n 


6 
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3.4 FINITE CALCULUS 


In the Fundamental Theorem of Finite Calculus (The- 
orem 1.4.3), Part (a) asserts how sums can be evaluated as 
anti-differences, analogous to way the fundamental theo- 
rem of infinitessimal calculus asserts that integrals can be 
evaluated as anti-derivatives. 


REVIEW FROM 81.4: 


e Given a function f : R > R, the difference function 
Af is given by the rule 


Af(e) = f(e+1) — fle) (3.4.1) 


e Given a sequence (z,,), we define the difference se- 
quence (Az,,) by the rule 


/ 
NG SE. SS GS 


e Theorem 1.4.3 [Fundamental Theorem of Fi- 
nite Calculus]. Let (z,,) be any standard sequence. 
Then 


n—1 k-1 
(a) oe = By = 2G; (b) ee = By 
j=0 j=0 = 
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Summing a Polynomial 


We recall that the finite calculus formulas for differ- 
encing and summing a falling power are directly analogous 
to the infinitessimal calculus formula for differentiating 
and integrating an ordinary power. 


REVIEW FROM 81.5: 


e The n“ falling power function on a real variable z, 
for any n EN, is defined by the rule 


n factors 


a 
ct = a(a—1)---(@-—n+1) 


e Thm 1.5.1. For r € Z*, we have A(#2) = ret. 








= nrth 
e Cor 1.5.2. For r € N, we have > ko = 
ram r+l 


REVIEW FROM 81.6: 


e Thm 1.6.1 Any ordinary power x” equals a linear 
combination of falling powers, i.e., in the form 


Tr 
a ) Pee 
r=0 


where the coefficients S,,,, are called Sterling numbers 
of the second kind. 
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We will now use the reviewed results to see how finite 
calculus makes many kinds of summation quite routine. 


Example 3.2.4, revisited again: In this chapter, we 
have already derived the summation formula 


se z 2n° + 3n7 +n 
k=0 2 


first using indirect perturbation, and then again with gen- 
erating functions. For this derivation, summation calculus 
is easier yet. With finite calculus, we first express k? as 
a linear combination of falling powers. For monomials of 
low degree, it is easy enough to calculate the coefficients 
of the falling powers by ad hoc methods. 


k? Sx2k% + Sy1 k* 


= k* + kt (34.2) 


Summing both sides of equation (3.4.2), we obtain 


mr 


sk = > (kh + ke) 


k=0 


= STR + Soa 
k=0 k=0 
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Applying Corollary 1.5.2 now yields 














ye ae ee ae 
k=0 3 k=0 2 k=0 
nae > .. ee d)e 
7 3 2 
Pore No 2 
7 i nmtn 
3 Z 
= In? + 3n7 +n 
= — 


Formula for Summing Exponentials 


The supply of useful finite summation formulas is 
readily extended beyond the monomial formula of Corol- 
lary 1.5.2. ‘This begins with sums and differences of expo- 
nentiations in which the base is constant and the exponent 
is variable. 


Theorem 3.4.1. Let the constant value c be a real num- 
ber and let x be a real or integer variable. Then 


Ac” = (c-1)c” 
Proof: Ac® = c*tl-~c® = (c—-l1)c*. » 
Example 3.4.1: For the case c = 2, ‘Theorem 3.4.1 gives 


the result 
BOS > AZ aS 
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which is analogous to the differential-calculus result 


d 


Xv a“ 
—e 
dx 


= 
This is one of numerous reasons why the number 2 is re- 
garded as the natural base of discrete mathematics in the 
same sense that the real number e is the natural base for 


continuous mathematics. More generally, the continuous- 
mathematics formula 


d Xv l XL 
—c" = lInc-e 
dx 


is analogous to the discrete-mathematics formula of The- 


orem 3.4.1. 
IC? 2 O= e* 


Example 3.4.2 A3" = 3°t! — 3 2-3" 
A4® = 4*tt_4® = 3.4* 


This leads to a major formula of the finite-summation cal- 
culus, the formula for summing exponentials. 


Corollary 3.4.2 [Exponential Formula]. Let c be any 
real number except 1. Then 


k CS 
= 3.4.3 
De eg eS) 
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Proof: Forc #1, applying the Fundamental Theorem of 
Finite Calculus to the conclusion of Theorem 3.4.1 implies 
that 








n—1 k n 
a2 
k=0 Cm k=0 
ec” —] 
e-—l ° 
Remark: For the case c = 1, which is excluded from 


Corollary 3.4.2, we have the sum 
n—1 
Sih = 
k=0 


Example 3.4.3: We observe that when c = 5 andn = 4, 
the value of the sum on the left side of equation (3.4.3) 


3 
ye SS BO se i PS el a On. Ss 456 
k=0 


agrees with the value of the quotient on the right side 


a 625 —1 
= = 156 
4 4 








As easy as it was to derive the formula 


ST 2h Se 1 
k=0 
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either with perturbation or with generating functions, it is 
even easier with the calculus of summation, as now shown. 


Example 3.2.1, revisited again: According to Theo- 
rem 3.4.1, we have 


Ag® = 2* 


Summing both sides, we obtain 
Sra = yaa 


after which, applying the Fundamental Theorem yields the 





result 
mm 
k=0 vs 
Se = i 


Falling Negative Powers 


The extension of the list of useful differencing and 
summation formulas continues. We observe that non- 
negative falling powers can be defined recursively. 


ge = 1 


gttt = g"(¢—r) for r > 0 


Running the recursion backward extends the utility of the 
falling power concept. 
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DEF: Non-positive falling powers are defined as fol- 
lows. 





| 

a girth 

s- = for r <0 
L—rT 


Example 3.4.4: Here are a few evaluations of the defi- 
nition of negative falling powers. 


0 0 











24 x= x= 1 
—— — re — — 
x — (-1) z+1 z+1 
pe x _ — _ 1 
z—(-2) «+2 (e+1)(e+2) 
a8 oe t= 1 
ee dx = = 


— @—(-3) «+3 (a@+1)(4+2)(4 +3) 
Proposition 3.4.3. For any positive number r and any 
real number «, 


——— 
(a +1)---(2@+r) 


Proof: By induction on r. o; 


Although ordinary powers are additive in a product 
of ordinary monomials with the same base, in the sense 
that 
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it is clear that falling powers are not additive in a product 
of falling-power monomials. For instance, 


a*-¢3 = a(x —1)- a(x —1)(x —2) 
but 
g2tS = g2 = x(a —1)(x —2)(x — 3)(x —4) 


Thus, there is no reason to expect that e—" = (a£)7!. 
On the other hand, an important analogy to infinitessimal 
calculus is preserved. 


Proposition 3.4.4. The difference formula for negative 
falling powers is the same formula as for positive falling 
powers. That is, for every positive integer r, 


hw = (=p) ga 


Proof: Start by applying the defining formula (3.4.1) for 
the difference operator. 


De = (ea) = a 

_ 1 1 

~ (@+2)--(a@trt1) (@+1)--(a+r) 
Then by routine manipulation 

1 1 1 

— (@+2)---(2@+r) |_—_ - | 

1 —T 
(o+2)---(e+r7r) [es 
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“Cenc aD 


we achieve the result 


hg (apg ® 


Cor 3.4.5. For any integer r # 0 and any real number z, 


Agt = rgta! 





Proof: This combines Thm 1.5.1 and Prop 3.4.4. o; 


Cor 3.4.6 [Monomial Formula]. For integers r #4 —1 
and n, 





n—-Ll peek n 
Sk = (3.4.4) 
k=0 r+ lpzo 
Proof: ‘This combines the Fundamental Theorem and 
Corollary 3.4.5. os 


Example 3.4.5: ‘To make a direct evaluation of the left 
side of Equation (3.4.4) for r = —2,-3 and n = 4, we first 
calculate the following partial table of the values of k=, 
i.e., of a small integer to a small falling negative power. 
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Case r = —2 and n=4: The value on the left side is 

1 1 1 i 
io 205." Bel Go 
and the value on the right side is 
4 


(me oe a 

















ko _ = © 1 Bes ein, © 
=a = —1 (-1)-5 -1 5 =? <6 

Case r = —3 and n=4: The value on the left side is 

a a ee es 
1 2 i ‘ 1 i: i 7 
1-2-3 Ds3.4.° (334.5.  4.5%6. ~~ 80 

and the value on the right side is 
kar ae ee 1 1 8 
—2|,.5 —-2 —-2 (-2)-5-6 (-2)-1-2 ~~ 30 





Harmonic Numbers 


Another analogy between the natural logarithm Inn 
and the harmonic number A, lies in the similarity of the 





derivative 
oF Ing = x? 
to the difference formula 
Ai, = n+1 — Hy, 
Sipe af a : : = =a : 
ane Aad 1 n 
1 =i 
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and, naturally enough, in the similarity of the summation 
formula 


nm—1l 
Sok = An (3.4.5) 
k=0 


to the integration formula 


t 
/ a 'dxe = Inz 
r=1 


Product Formula 
Analogous to the product formula for derivatives, 
(u(x) + v(a))" = w(x) -v(a) + ula) -v'(x) 
there is a product formula for finite differences. 
Prop 3.4.7 [Product Formula]. Let h(x) = g(x)-f(a). 
Then 
Ah(x) = Ag(x)-f(«+1) + g(z)-Af(x) (3.4.6) 


Proof: Once again, it is sufficient to do some routine 
algebraic manipulation, starting from an application of 
the definition of the difference operator. 


Ah(a) h{a+1) — h(x) 
gle +1)-f(e@+1) — g(x): f(z) 
= glet+1)-fle+1) — g(x): Fle) 
— g(x) F(@ +1) + glx): fle +1) 
Ag(x): fle +1) + g(a): Aflz) % 
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Example 3.4.6: Take g(n) = n? and f(n) = Hy. Ac- 
cording to the product formula (3.4.6), we have 


A(n2 Hy) = An®- Anyi + n*-AHn 








1 
= Die. —]1)-——_ 
n- Angi + n(n a 
1 n2=—n 
= 27 | Aye —_—_— 
n( +s) + n+l 
netn 
= On. 
- z n+l 
= 2h 


Unsurprisingly, evaluating the defining formula (3.4.1) for 
a finite difference yields the identical result. 


A(n@H,) = (n+1)* Anyi — n2 Hy 


(n° +n) (un ~ —) — (rn? —n)H, 





n+l 
n2+n 
= (n? +n)H, + mae = (n? —n)Hy, 
= 2nH, +n 


Summation by Parts 


From the infinitessimal calculus, we recall the follow- 
ing formula for integration by parts 


| ‘ule)e"(ode = ule)o(@)]’ [ eee 


The finite calculus has an analogous formula, called sum- 
mation by parts. 
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Proposition 3.4.8 [Summation by Parts]. Let g(k) 
and f(k) be functions on the integers. Then 





Ss a(K)AF(R)) = al — ST ACoA) F(E+1) 
= = (3.4.7) 


Proof: This corollary to Proposition 3.4.7 follows from 
the Fundamental Theorem of Finite Calculus. © 


Example 3.2.2, revisited again: After using the sub- 
stitutions 


g(k) = ke and — f(k) = 2° 


summing the sequence (k2” | k = 0, 1, 2, ...) by parts 
takes the form 


s k2* = s ko" 
k=0 k=0 


| 
a 
ke 
No 
ca 
ae 
a 
© 
No 
os 
+ 
ke 


which leads to the calculations 
= (n+1)-2°7* - 2m 


= (n+1)-2""! - ior —1) 
= Sager ® 
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Example 3.2.3, revisited: Since integration by parts 
is helpful in evaluating the integral of Inz to «lng — az, 
it is unsurprising that summation by parts is helpful in 
summing H, tonH, —n. 


n—1 n—1 
Se = > OR, 
k=0 k=0 





n—1 n—1 
nr 1 nr 
= kU SEN ech Me ee GU se 
"lo ei eer “lo Dy 
= (nH, —0)-—n 
= nis =n 


Table 3.4.1 Formulas for the calculus of differences. 








function difference function 
ke rktat 
cf (c—1)c* 
1 
Hy, 
n+l 


Gk) f(k) | Ag(#) f@ +1) + g(a) Af (a) 
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Table 3.4.2 Formulas for finite summations. 


summation formula reference 














k c* —] 
24 Cn Sas 0 ai (3.4.3) 
n—1 rack 
ke, or Al (3.4.4) 
r+l 
ay nm—1l 
Sok ae (3.4.5) 
a k=0 
Yo atk) A (F(4)) alk) F(R)] 
k=0 


— S/A(g(k)) f(k+1) (3.4.7) 
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3.5 ITERATED SUMS 


In the simplest case of iterated summation the index 
set U 
1eU 
is a 2-dimensional array, such that one could first take the 
row sums and then add those sums to get the total. Some- 
times the first summation, called the inner summation, is 
for groupings other than rows. Sometimes, when given a 


double summation to evaluate, it is helpful to swap the 
order of summation. 


Double summation need not be twice as hard. Indeed, 
sometimes a single sum is recast as a double sum to make 
it more tractable. A possible strategy in evaluating a dif- 


ficult sum 
1eU 
is to find a partition 


Ue Us 


kek 


such that each of the sub-sums 


1eU;, 
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is tractable, and also such that the sum 


Dh = Do 


kek kek 1€CU, 


of the sub-sums is tractable. 


Independent Indices 


An example from graph theory illustrates the simplest 
case of a double summation, in which the index of the inner 
sum is independent of the index of the outer sum. 


Example 3.5.1: The degree of a vertex v of a graph 
is the total number of edge-incidences on v. Each edge 
e contributes 0, 1, or 2 to that total, corresponding to 
the number I(v, e) of times that vertex v is an endpoint of 
edge e. Figure 3.5.1 shows a graph, with its vertex degrees 
as bold numbers. 





Fig 3.5.1 Degrees of the vertices of a graph. 
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Thus, the sum of all the edge-vertex incidences 


ys I(v,e) 


(vie) EVXE 


is indexed by the cartesian product V x &, where V is 
the set of vertices and EF the set of edges. The obvious 
partition of this sum over a cartesian product of two sets 
is into an iterated sum 


>, 10,6) = S_ dS Iv, e) 


(vie)EVXE veV ecE 


over the incidence matrix, with rows labeled by vertices 
and columns by edges, so that the row-sums are the de- 
grees. 


s t uv ov 2“ y z degree 
a 22 10 0 0 0 1 6 
6 0 0 1 1 1 1 0 0 4 
c 00 00 00 1 1 Z 
d 00 01 1 1 41 £40 4 


Of course, the result of adding the row-sums of an array 
equals the result of adding the column-sums. In this case, 
since every column-sum is 2, adding the column-sums is 
equivalent to doubling the number of edges, which is faster 
than adding degree-sums. This observation yields an al- 
ternative proof of Euler’s Degree-Sum Theorem (Theorem 
0.6.1). 
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Thm 3.5.1 [Euler’s Degree-Sum Thm]. The sum of 
the degrees of the vertices of a graph equals twice the 
number of edges. 


Proof: Let V = (V,F) bea graph. Then starting from 
row sums 


» deg(v) = Se ye I(v,e) sum of row sums 
vEV veV eCE 


swap the order of summation: 


= > yi I(v,e) sum of column sums 


eCHk veEV 

— > 2 every column sum is 2 
eCH 

= |b ° 


Interchanging the order of summation is a fundamen- 
tal technique for evaluating an iterated sum over an array. 
It is useful when the implicit repartitioning yields inner 
sums and an outer sum for which the total effort of eval- 
uation is less than that for the given iterated summation 
problem. 


In this instance, the cost of repartitioning was trivial, 
because the index of the inner sum of the given iterated 
sum was independent of the index of the outer sum. 





: 


nl 
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Dependent Indices 


If the limits of the index of the inner sum are inde- 
pendent of the index of the outer sum, then the order of 
summation can be transposed without changing the limits 
of either index. However, it is quite common for the outer 
index to range from 1 to n, while the upper limit of the 
inner index equals the outer index. As illustrated in Fig- 
ure 3.5.2, this amounts to summing over the rows of the 
lower-left triangle of an n x n array. 


row sums 





5,5 D 


n,3 ries nn 


Fig 3.5.2 Row sums and column sums. 


Interchanging the order of summation for this form of it- 
erated sum turns the outer sums into column sum. The 
new inner index (the row index) has the outer index (the 
column index) as its lower limit and ranges up to n. 


Example 3.2.3, revisited: The sum of the harmonic 
numbers has previously been evaluated by perturbation 
and by finite calculus. Another effective method is go- 
ing to a double sum and then interchanging the order of 
summation. 
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i 
ke 


write as double sum 


ae 
>. | Fe 


re Fr 


n—1 
YH = 
k=0 


vail 
an) 
So 
I| 


3 
| 


swap order of summation 


| 
i 


ca 
II 
SS, 


3s fM 
a 
= 

| 

ke 


factor out constant 


| 
S2. | Fe 
ee 


| 
re Fe 
a 
SS. 


BOG 
| 


evaluate inner sum 


| 
M4 
>. [| Fe 
= 
S 


SB &. 
| 
re Ff 


71 
— Ss add zero 
= i > = n = 1 
= nil. = a 


In circumstances when swapping rows and columns of 
an array does not adequately reduce the evaluation, it may 
help to reorganize the partitioning of summation so that 
the inner sum is over some tractable geometric pattern 
other than a row or column. 
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Example 3.5.2: ‘Io envision how to repartition the dou- 


ble sum 
nm k—-l 


as an aid in evaluation, it helps to write out the array of 
summands, like this. 





2 | 3 
e || 4 
1 | 3 
5 | 3 


Evidently, summing rows or columns amounts to summing 
the harmonic numbers. However, the strategy of summing 
on the southeastward diagonals (in which the entries are 
constant) yields the following result, which is consistent 
with Example 3.2.3, which also sums harmonic numbers. 


wl NlF FR 


NI Fle 
r|rR 


3 

3 
| 

Q 





Q 

I| 

ee 

— 

I| 

eH 
Qu) re 
ir 
e 
Qu 
| 

e 
Qu 
ir 

e 
Qu 


= 
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Linear Partitioning: Floor Sums 


Sometimes a sequence of less tractable summands can 
be partitioned into consecutive finite subsequences with 
tractable sums. In particular, this may occur when the 
summands are the floors or ceilings of a non-decreasing 
sequence. 


Example 3.5.3: Seeking to evaluate a sum of floors may 
suggest resorting to an approximation, such as 


Y [ve] ~ Dove 


k=0 
nr 
i ot! da 
x=0 


mr 


2 


2 


2 
& 8/2 





x=0 


2 
ese 
3 


For n = 9, the value of this approximation is 


2 
73/2 


2 2 





n=9 


whereas the exact value is 
[vo] + | va] + |v2] + [v5] 
+ [v4] + [v5] + [v6] + [v7] + |v8] + |v 
= 0 Ae lee Se ee 2 2 Ses 2a = 16 
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Sometimes, an approximation this rough meets the 
purpose at hand. However, it is helpful to be in command 
of methods that get an exact value when it is needed. 
There are five steps in the derivation of an exact eval- 
uation formula for such a sum by the method of linear 
partitioning. 


Step 1. List the early terms of the sequence, 
and partition them according to the value of the 
summand. 


Step 2. Express the size of all but the last cell. 


Step 3. Express the size of the last cell, which 
needs individual attention, since its size might 
not follow the same rule as the other cells. 


Step 4. Evaluate the given sum. 


Step 5. Confirm for a small case. 


We now demonstrate the application of this method to 
Example 3.5.3. 
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Example 3.5.3, cont.: As the index k of the sum 

>~ | ve] 

k=0 
increases, the value of the summand |[k] increases also, 
but more slowly than the index itself. 


Step 1 is to partition the index values according to the 
value of the summand. The partition for k = 0,1,...17 is 
as follows: 


Table 3.5.1 Partitioning for the summand |k|. 


1 3 o 7 


LO aaa ——— ee 
k 0 123 45678 9101112131415 1617 
Ace! Or Rit 22299 BB. 38 8 Bk. wl A 


Step 2 is to express the sizes of all but the last cell. Each 
other cell in Table 3.5.1 is grouped with an overbrace, with 
its size written over the overbrace. The smallest number 
within each cell is the square m” of some number m. Since 
the number (m+ 1)? starts the next cell, it follows that 
the cell containing m? is 


ce iP el weap teg m* + 2m} 
Evidently, 


eae ee m> + 2m} = 2m+1 


x 


k=0 
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Step 3 is to express the size of the last cell 


{| Vn |’, eas n} 


whose entries correspond to the uppermost summand |/n |.§ 
Its size is 


#{LJ/n]?, sete n} =n—|Vn|?+1 


Step 4. To evaluate the given sum, we multiply each of 
the realized values of the summand by the corresponding 
number of values of the index k and then sum the prod- 
ucts. 


[Va |-1 
VE] = So Qm+i)-m + (n- [vn]? +1)-|val 
LVvaj-1 
= DE Q@mt+amt) + (n—Lvn}? +1) -|va 
me 3m2\|Lv" , 
= (FES), + oval +) va 


= an owe + (n-|va]’ +1) -Lve 


Step 5. We confirm for the small case n = 11. 
Sum values in Step 1: 


Ors ee eee ee 2 2 ee — 22 
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Compare with the value by formula of Step 4. 


So (1 - [vin] +1) - [vi 


= 44943-3 = 22 


Example 3.5.4: We now use linear partitioning to eval- 


uate the sum . 
S_ Lg &! 
k=1 


Step 1. List the early terms of the sequence, and partition 
them according to the value of the summand. 


1 2 4 8 16 
ean ees eer 
k; 1 234567 8---15 16--- 31 3233--- 


lIg¢k} O 1122223---3 4.+--4 5 5. 


Step 2. ‘To express the size of all but the last cell, we 
observe that the cell corresponding to the summand m is 


ee ls tia ay 
Its size is 
Pe td ay Sd) 
Step 3. The last cell is 


j2uend, plerd 4a, n 


mr 


Ds 


k=1 
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Its size is 
Quer ae 


Step 4. Evaluate the given sum, using the previously de- 
rived formula (e.g., see Example 3.2.2) for summing k- 2", 


[lg n|—1 
Se m-2™ + [Ign| (n pues 1) 
mal 


[Ig 
= ([Ign]—2)-2le"l + 24 [Ign] (n- 20a! +1) 


Step 5. Confirm for the small case n = 9. 
Sum the values in Step 1: 0+1+1+2+2+2+2+4+3+43 = 16. 
Compare with the value given by the formula of Step 4. 


(\lg9] —2)-2Ue9 4 2 + [Igo] (n — leo +1) 
= (3-2)-2° + 24 3(9-2°+1) 
= 1s 4 2 PB a2 S16 


Remark: The two evaluations just considered have an 
easy second step, because within each group the value of 
the summand is constant. If the summand were k| Vk], 
for instance, then an inner sum might be introduced in 
Step 2 for the partial sum over the interval corresponding 
to a group. 
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3.6 INCLUSION-EXCLUSION 


Sometimes, the index set for a complicated sum has 
subsets with tractable sums, but those subsets overlap. 
The strategic insight of the inclusion-exclusion method is 
that the partial sums over those subsets can be combined 
into a total sum by subtracting the overcounts. 


Venn Diagrams for Overlapping Subsets 


The simplest case has two overlapping subsets, A and 
B, asin Figure 3.6.1. ‘The domain from which both subsets 
are drawn is denoted U. 


U 


Fig 3.6.1 Venn diagram for two overlapping subsets. 


Suppose that the objective is to calculate the sum Syyp 
over the set AU B, with partial sums S4, Sg, and Saqp 
over subsets A, B, and Syanp, respectively. Then 


Saup = Sa+Sp-—Sane 
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Example 3.6.1: The number of integers in the range 
1,...,n that are divisible either by 2 or by 3 is expressible 
as a consecutive sum with indexing in the integer interval 
[1 : n] and the summand 


f(k) = { 1 if nis divisible either by 2 or by 3 


0 otherwise 
that is, by the sum 


mr 


S(2\k V 3\k) 


k=1 


Every number that contributes 1 to this sum lies either in 
the set {k € [1 : n] | 2\k}, with cardinality [n/2], or in 
the set {k € [1 : n] | 3\k}, with cardinality |n/3|. Adding 
these two cardinalities overcounts by |n/6], the number 
of integers in [1 : n] that are divisible both by 2 and by 3. 
Thus, 


mr 


S_(2\k V ohh) = S| sp: =| = H 
Sell 
In Figure 3.6.1, 


A = {ké[L:n] | 2\k} 


and 


B {k €[1:n] | 3\k} 


Their intersection is 
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ANB {k €[1:n] | 6\k} 


and 


Ui = Lg) 


In Example 3.6.1, the implicit summand is the num- 
ber 1, since we counted the number of elements in a set. 


That is, 


Sx yl Ss ee 


hex 
for X = A, B, ANB, or AUB 


Example 3.6.2: <A related problem is to calculate the 
sum of the numbers that are divisible by 2 or 3. Then, 
instead of having a constant value of 1, the value of the 
summand equals the index itself. That is, 


Se = Sh 


hex 
for X = A, B,ANB, or AUB 


[n/2] [n/2| 5 
Sa = Ske yee fog 


2 
2k a 


Similarly, 
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So = he = 3 VSP +le/3I 


2 
3\k<n 
and 
[n/6]° + |n/6] 
Sang = Dok = see 
6\k<n 
Therefore, 


Savie = oat Se GARB 


= igh i ie n/t [e/3) = ¢ eee [n/6] 


For the small case n = 14, direct addition and the formula 
both yield Saug = 68. 


Venn Diagrams for 3 or More Subsets 


Venn diagrams are quite commonly drawn for three 
overlapping subsets, and they have this general definition. 


DEF: A family of n simple closed curves (typically circles 
or ellipses) in the plane, whose interior regions represent 
some subsets A,, Ao,..., An of aset A within a domain U, 
is called a Venn diagram, after the logician John Venn 
(1834-1923). 


TERMINOLOGY: ‘The domain U from which both the sub- 
sets A and B are drawn is commonly called the universal 
set. 
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Example 3.6.3: Eurasian Translators, Inc., employs 15 
expert linguists fluent in at least two of the three languages 
Armenian, Bulgarian, and Czech. Of these, 


Sane = 5 speak Armenian and Bulgarian 
Sanc = 7 speak Armenian and Czech 
and Spac = 9 speak Bulgarian and Czech 


How many speak all three languages? Figure 3.6.2 is the 
relevant Venn diagram. 


U 


7S 
KZ 
= 


Fig 3.6.2 Venn diagram for three overlapping subsets. 


Whereas 15 is the given number of linguists fluent in two 
or more of the three languages, the sum 


SanpB+Sanct+SsBac = 9+74+9 = 21 


of the numbers corresponding to the three intersection- 
regions for which data are given triple-counts the contri- 
bution S4qpenac in the triple intersection at the center of 
the diagram and counts all the other translators only once. 
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Thus, subtracting 15, thereby excluding the total number 
of translators who speak at least two of the languages by 
the calculation 


2 SAABNC — 21—15 = 6 
yields the result 
2S5ansnc = 6 


from which one concludes that 
Sansnc = 3 


It is helpful to check such a result by inserting numbers 
into the regions of the Venn diagram. In this case, the 
number 3 is inserted into the centermost region, represent- 
ing the population of the region AN BNC in Figure 3.6.3. 
Then it must be excluded from the populations given for 
composite regions AN B, ANC, and BNC, in order to 
obtain populations for the simple regions they contain. 


NOTATION: ‘The complement of a set X in a domain U is 
denoted X. 


PA 
Sy 
OS® 


Fig 3.6.3 Inserting numbers into regions of a Venn diagram. 
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We observe that all of the inserted numbers are consistent 
with the original data as well as with the derived popula- 


tion of AN BNC. 


Sanp = 5 = 2+3 = S,,p,q+ Sansnec 


Sanc = 
and Spac 


44+3 = S$, 550+ Sansne 


eo = SAABNC + SAnBnc 


Context for Inclusion-Exclusion 


A more general context of inclusion-exclusion evalu- 
ations is a domain U, a real-valued function f : U - R, 
and a representation 


A= UJ Ak 
k=1 


of set A as a union of subsets A;,, regarded like a Venn 
diagram with n mutually intersecting subsets. (Some of 
the regions may be empty.) 


Remark: Quite often, the function f : U — R is the 
constant function f(z) = 1, in which case the evaluation 
amounts to calculating the cardinality of a region. 


NOTATION: The intersection of two sets A; and A; may be 
denoted by the juxtaposition A;Aj;. 


DEF: An intersection A;, A;, --- Aj, 1s called an r-fold in- 
tersection of the family {A;}. 
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Formulas for Inclusion-Exclusion 


As illustrated by Example 3.6.3, evaluating sums over 
combinations of regions in Venn diagrams takes some care. 
Fortunately, such evaluations can usually be reduced to 
the application of two or three general inclusion-exclusion 
equations for sums over single regions. 


Thm 3.6.1 [Exclude-All Equation for Set Size]. Let 
Aj,...,An be subsets of a set U, with 


A= LJ Au and S$, = ye [Aycorn Ay | 
k=1 


11 y.-ytrE[1in] 


so that, forr =1,...,n, the number S,. is the sum of the 
cardinalities of all r-fold intersections of the family {A,}. 
Then 


JA] = |Ay Ap --- An| = |U] + S—(-1)* 5% 
k=1 


Proof: First suppose that the element x € U lies in none 
of the sets A;. Then x is counted once on the left side of 
the equation, and it is counted in the formula on the right 
side only by the summand |U], and not by any summand 


Sh. 


Now suppose that x lies in exactly m of the subsets A;, 
with m > 0. Accordingly zx is not counted on the left side 
of the equation. On the right side, it is counted ("") times 
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by Sz, since there are Ce) ways to choose k sets from the 
m sets A; that contain x, and z is also counted once by 


|U|. Thus, its net count on the right side is 


Bor)" 





The other main incl-excl formula is derivable by the same 
approach, or, as shown here, as a corollary of Thm 3.6.1. 


Cor 3.6.2 [Include-All Equation for Set Size]. Let 
Aj,...,An be subsets of a set U, with 


A= Lae and SS, = Se AeA 
k=1 


11 ye0ytr E[1in] 


so that, forr =1,...,n, the number S,. is the sum of the 
cardinalities of all r-fold intersections of the family {A;}. 
Then 


mr 


JA] = So(-1) 15% 


k=1 


Proof: Observe that the universal set U is the disjoint 


union of the set A and the set A; Ay --- A,. Therefore, 
JU] = |A| + | Ar Az «+> An 


and, accordingly, 
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ia io) lag gee A | 
and then, by ‘Theorem 3.6.1, 


(| eee S(-0s) ig ee 


k=1 


mr 


> (-1)* 18s 0 


k=1 


Thm 3.6.3 provides an inclusion-exclusion formula for 
the sum of the values of an arbitrary function f:U > R 
on the domain U, not simply for counting the size of a set. 


Thm 3.6.3 [General Exclude-All Eq]. Let Aj,...,An 
be subsets of a set U, with 


A= |J Ax 
k=1 
and let f :U — R be a real-valued function. Let the sum 


S = >» Fle) 


tissest eel lil eA Ag. 


be taken over all r-fold intersections of the family {Ax}. 
Then 


~ f@ = Di@ + Devs. 


tEA, An xEU 
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Proof: The proof is a straightforward modification of 
the proof of the Exclude-All Equation for set sizes. o; 


In the remainder of this section, the two main inclusion- 
exclusion formulas are applied to various combinatorial 
problems. 


Stirling Subset Numbers 


Although there are various similarities between Stir- 
ling numbers and binomial coefficients, there is no known 
closed formula for Stirling numbers of either kind, unlike 
the situation for binomial coefficients. However, there is 
a summation formula for a Stirling subset number, whose 
derivation by inclusion-exclusion is our immediate objec- 
tive. ‘The ideas involved are encapsulated in the following 
example. 


Example 3.6.4: ‘The Stirling subset number ee is the 
number of ways to distribute a set of 5 objects into 4 
cells with no cell left empty. For a problem this small, 
listing cases is easy enough, but it is instructive to ap- 
ply inclusion-exclusion. Toward that objective, for 7 = 
1,2,3,4, let A; be the set of distributions with box 2 left 
empty. Clearly, 


Al SB eres 3-4 
JA;Aj] = 22 fort Aj 
|A;A;Ax| it for mutually distinct 7,7,k 


Moreover, 
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4 5 
2 ke (4—k) (3.6.1) 


since there are @ ways to choose k of the subsets A; from 
the collection of four such subsets, and each intersection 
Aj, Ai, +++ Ay, contains (4 — k)° objects. Furthermore, 


| Ay Ap As Ag| = vata (3.6.2) 


since each distribution with none of the boxes left empty 
amounts to assigning the labels 1,2,3,4 to the cells of a 
partition. Finally, if U is the set of all ways to distribute 
5 objects into 4 cells, we have 


jo) = (3.6.3) 
When we substitute into the Exclude-All Equation 
A; Az Az As | = |U| — 5; + Ss — $3 4+ S4 


the values from Equations (3.6.1), (3.6.2), and (3.6.3), we 
obtain the equation 


fi}e = eee Oe Ore Gy 


= 1024 — 972 + 192 —4 = 240 


5 240 
ee ea 
i it 4! 
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A confirming observation is that, since two of the elements 
are paired, and since the others have cells to themselves, 


clearly 
: D \ () 10 
4 2 


In a similar manner, an inclusion-exclusion analysis 
leads to an identity for the Stirling subset numbers 


as 


Proof of the following theorem simply generalizes the steps 
and calculations that we have just completed. 


Theorem 3.6.4. Let n and k be a pair of non-negative 
integers. Then 


Proof: For 7i=1,2,...,k, let A; be the set of distribu- 
tions of n distinct objects into k distinct boxes with box 2 
left empty. Clearly, 


|A;| = (k-1)” for S22. heek 
and, more generally, for any 7 € [1,k] 


|Ai, Ain Ay,| = (R—9)" for mutually distinct 21, 72,...,2; 
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Since there are @ ways to choose the mutually distinct 
21,22,--+,2;, and since 5S; is the sum of the numbers of 
ways to leave j specific boxes empty (with others possibly 
empty also), it follows, by analogy to Eq. (3.6.1), that 


S; = ()e- i)” (3.6.1) 


J 


Furthermore, 


Ay Ag emg | Le (3.6.2') 


since each distribution with none of the k boxes left empty 
amounts to assigning the labels 1,2,...,k to the cells of a 
partition. Finally, if U is the set of all ways to distribute 
n objects into k cells, we have 


eee (3) (k —0)" (3.6.3) 


Substituting the values from Equations (3.6.1’), (3.6.2), 
and (3.6.3) just above into the Exclude-All Equation 


|ALAg-- Ag| = |W] - Si: + Sp ++ + (IS: 


we obtain the identity 


(pH = Dy (Mya > 


Section 3.6 Inclusion-Exclusion 71 


Derangements 


Inclusion-exclusion is also helpful in analyzing the de- 
rangement recurrence. 


REVIEW FROM 82.1: 


e A derangement is a permutation 7 with no fixed 
points. 


e The derangement number D,, is the number of 
derangements of the integer interval [1 : n]. 


e The derangement recurrence (see also §5.4) is 


Do = 1, Dy, = 0; (3.6.4a) 
Da = (n—-1)Dp-1 + (n—1)Dp—2 (3.6.46) 


Example 3.6.5: In the classical hatcheck problem, 
each of n persons leaves a hat in the cloakroom, but the 
hatchecks are lost, and the n hats are redistributed ran- 
domly. It asks, what is the probability that no hat goes 
to its rightful owner? This problem is recognizable as a 
homespun version of calculating the proportion of permu- 
tations of n objects that are derangements. 


If U is the set of all possible hat distributions, then 
haa (3.6.5) 


To calculate the number D,, of derangements, let A; be 
the set of permutations in which hat 2 goes to its rightful 


72 Chapter 3 Evaluating Sums 


owner. Then 


Dy 


| 
S| 
. 


-An| (3.6.6) 


and 


|Ai| = (n—1)! 
More generally, 
aaa Scr) 
which implies that 
n 
| (") (n—k)! (3.6.7) 
When the Exclude-All Equation 
Ay Ay +++ An| = |[U| + $°(-1)*S; 
k=1 


is combined with Equations (3.6.5), (3.6.6), and (3.6.7), 


we obtain 


Dn = nt + iy ({) (a0) 
= ey (3.6.8) 


Substituting n = 0 and n = 1 into Eq. (3.6.8), we obtain 


0 0! 
Do = aa, 
k=0 ; 
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50! 


— (—1) i =— | 
and 
1 u 
Di = So(- a 
k=0 
1! 1! 
= ("5 + (-'G = 1+(-1) = 0 


which establishes that equation (3.6.8) satisfies the initial 
conditions (3.6.4a) of the derangement recurrence. More- 
over, assuming that equation (3.6.8) satisfies the recur- 
rence for D,_1 and D,y—2, we confirm by the following 
calculation that it also satisfies the recurrence for D,,. 


(n—1)Dp—1 + (n — 1I)Dy_-2 


n—-Ll 








= (n—- 1) > (9 a 7) + (n- 1) So-1) (n ma 
= y( yee yy" e 1 y(-1) tl) 
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Remark: We observe that 


yee (3.6.8) 


k=0 


Dy 


implies that 


ni 
k=0 


Thus, one might approximate the value of e~', and hence, 
of the number e, by generating random permutations and 
counting the proportion that are derangements. 


Counting Bipartite Matchings 


In the rest of this section, a prior acquaintance with 
graph theory would be helpful. 
REVIEW FROM 80.6: 


e A bipartite graph G is a graph whose vertex set can 
be partitioned into two subsets X and Y s.t. every 
edge has one vertex in X and the other in Y. 


PREVIEW OF 88.6: 


e A matching in a graph is a set of edges such that no 
two edges have an endpoint in common. 


e A perfect matching in a graph is a matching in 
which every vertex is the endpoint of one of the edges. 
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Example 3.6.6: In the bipartite graph G of Fig 3.6.4, 
there are five perfect matchings: 


one with f(1) =a _ (shown with thicker edges); 
three with f(1) = 8; 
and one with f(1) =. 


3 Cc 


A d 
Fig 3.6.4 Perfect matching in a bipartite graph. 


Let U be the total set of bijections X + Y. Then 
(U)| = AL 24 (3.6.9) 


A bijection X — Y is consistent with the bipartite graph 
G if it is representable as a perfect matching in G. For 
27=1,2,3,4, let A; be the number of bijections f : X 7 Y 
such that the assignment 7 ++ f(z) is inconsistent with the 
graph G, that is, such that vertex f(z) is not adjacent to 
vertex 2. 


For each choice of a vertex of Y that is not adjacent to 
vertex 7, there are 3! bijections X — Y, corresponding 
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to the 3! ways to assign the other 3 vertices of X to the 
remaining 3 vertices of Y. Thus, 


[Ail = (4 —deg(i)) 3! 
and there are similar formulas for multiple inconsistencies. 
|A,A2| = |A1,A3| = |A1A4] = |AgAa] = |AzgA4| = 2! 
and | Az A3| =D 
|A, A2A3| = |A, A3Aq| = |A2A3A4| = 1 and |A,A2Ag| = 0 


Se. 6p 802 So = aS, SS = 0-39.60) 
Therefore, by using (3.6.9) and (3.6.10) with the Exclude- 


All Equation, the number of perfect matchings is shown 
to be 


A; Az As Ag | |U| — $1 + Sz — $3 + Sa 
24 —- 30+ 14-340 


= 95 


which agrees with our ad hoc count at the outset. 


An alternative representation of this counting problem is 
the chessboard of Figure 3.6.5. Observe that a square is 
shaded if and only if it is forbidden to match the vertex 
corresponding to its row to the vertex corresponding to its 
column. Each perfect matching corresponds to a selection 
of one unshaded square in each row, such that there is at 
most one selection in each column. 
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Fig 3.6.5 Chessboard representation of a bipartite 
matching problem. 


Remark: The method given here is applicable also to 
counting complete matchings of a bipartite graph when 
one part has more vertices than the other. 


Chromatic Polynomials 


An algebraic invariant called the chromatic polyno- 
mial of a graph can be calculated by inclusion-exclusion. 


PREVIEW OF 88.3: 


e A vertex-coloring of a graph G in the set [1 : n], 
often simply called a coloring, is a function f : Vg > 
(1. ev) 

e A proper vertex-coloring of a graph is a color- 
ing such that no two adjacent vertices have the same 
color. 
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DEF: The chromatic polynomial P(G,,t) of a graph G 
is the function whose value at the integer ¢ is the number 
of proper colorings of G with at most t colors. 


As a preliminary to a more systematic approach, we con- 
sider an ad hoc construction of a chromatic polynomial. It 
illustrates why the function P(G,n) is a polynomial. 


Example 3.6.7: The graph Aj,2 requires at least two 
colors for a proper coloring. We observe that, given two 
colors, exactly two proper 2-colorings are possible, one of 
which is illustrated in Figure 3.6.6. We write pg = 2. 
Given three colors, exactly six 2-colorings (i-e., 3!) are 
possible, so we write p3 = 6. 


Fig 3.6.6 A proper 2-coloring for the bipartite graph Kj 2. 


For any positive integer t, the #4 ways to choose two colors 


is (5) and the number of ways to choose three colors is (es 


By Rule of Product and Rule of Sum, it follows that the 
number of proper colorings with t colors is 


—1 —1)(t-2 
- ) + (5) oe 
(t° —t) + (t° — 3? + 2t) 
Sf Sor ag (3.6.11) 
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In general, it may be computationally difficult to de- 
termine the exact numbers of proper colorings 


Pi Pp2 <'* Pn 


for an n-vertex graph G, or even to decide the chromatic 
number, i.e., the smallest positive value. However, what- 
ever those numbers may be, the chromatic polynomial is 


P(G,t) = n(5) + (5) andi + po") 


Example 3.6.7, cont.: ‘To recalculate the chromatic 
polynomial of the graph Ay,2 by inclusion-exclusion, its 
two edges are labeled 1 and 2, as shown in Figure 3.6.7. 


Fig 3.6.7 An edge-labeling for the bipartite graph kK, 2. 


Let U be the set of all colorings of Ay. with at most ¢ 
colors. Let A, be the set of such colorings in which the 
endpoints of edge 1 have the same color, and let Az be the 
set of such colorings in which the endpoints of edge 2 have 
the same color. ‘Then 


PUR a) = IU |= ALA 


This is a job for the Exclude-All Equation. Evidently, 
=e 
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To calculate |Ai|, we recognize that there are t possible 
choices of a color for both endpoints of edge 1, and then 
another t possible choices for the color of the remaining 
vertex. Clearly, this holds also for |A2|. Thus, 


|Ai| = |A2| = ¢° 
and S$; =|A;| + |Ae| = 20 


Any coloring in A, M Ag has the same color at both end- 
points of edge 1 and the same color at both ends of edge 2. 
Since these two edges share an endpoint, all three vertices 
of Ay.2 must have the same color. There are ¢ possible 
choices for this color. ‘Thus, 


| 
o 


|Ai Ag| = 
and So = |A; As| = 


We now complete the recalculation, 


| Ay Az] = |U| — Si + Sp 
SS, OP he (3.6.12) 


which agrees with (3.6.11). 


Example 3.6.8: ‘To calculate the chromatic polynomial 
P(K4,t) of the complete graph K, by inclusion-exclusion, 
label its its six edges with numbers 1,...,6, as shown in 
Figure 3.6.8. 
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a 


| 
Fig 3.6.8 An edge-labeling of the complete graph K4. 


Let U be the set of all colorings of Ky with colors in [1 : ¢]. 
Then 
wl = 


Next, let A; be the set of colorings in [1 : t] such that the 
endpoints of edge i have the same color. Then |A;| = ¢°, 
since there are t possibilities for the color of the endpoints 
of edge 7 and t possibilities for each of the other two ver- 
tices. Since there are 6 edges, it follows that 


a |A;| as | A2| See | Ao| = 6t° 


There are ) = 15 ways to choose a pair of edges, 7 and 
yj. For 3 of these pairs, edges 2 and 7 have no vertex in 
common, in which case a coloring in A;A; may use t colors 
for the endpoints of edge z and t colors for the endpoints of 
edge 7, yielding t? possibilities. For the 12 pairs of edges 
that have a vertex in common, there are t choices for the 
color of the three vertices in the union of their endpoint 


82 Chapter 3 Evaluating Sums 


sets and ¢ choices for the remaining vertex, again yielding 
t? possibilities. Thus, 


Sy = 15t? 


There are C) = 20 ways to choose three edges. Exactly 4 
of these 20 triples form a 3-cycle. There are t choices for 
the color of all three vertices in that 3-cycle and t choices 
for the remaining vertex. The other 16 edge-triples form 
a connected subgraph (a spanning tree) that contains all 
four vertices of K4, so all four must get the same color, 
for which there are t choices. Accordingly, 


S3 = 4t? + 16t 


A subset of four or more edges must contain all the vertices 
of Ky. It follows that 


6 
sa (S\t = 
6 
= (Sr = 6 
() 
tL=t 
6 


By the Exclude-All Formula, 


and S¢ 


P(K4,t) = |[U| — $1 + Sz — S3 + Sa — Ss + So 
= t* — 6? + 15¢? — (4¢7 +16t) + 15t — Gt + ¢ 
=f S56 lie = 6 
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Remark: There are circumstances where the Exclude-All 
Formula is an excellent way to calculate chromatic poly- 
nomials, which are not revealed by these small examples. 
For instance, it can be used to prove that the chromatic 
polynomial of any n-vertex tree is t(t —1)"—1. 


